Comparison of Topic Language Models for Query Disambiguation in Information Retrieval
نویسنده
چکیده
A long-standing challenge in information retrieval is to disambiguate query words for more precise search results. However, two or more meanings of a word in a query, or polysemy, deteriorate the precision effectiveness of information retrieval systems. There is a need for correct and effective information retrieval in many information systems such as health care and customer relationship management. This paper examines three topic language models that are mentioned in the literature for their ability to handle polysemy in query words. The three topic lanauge models are--latent semantic analysis, probabilistic latent semantic analysis, and latent Dirichlet allocation. We review these models and compare their performance in query disambiguation. Our study provides guidance on the use of these models in information retrieval systems.
منابع مشابه
Improved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملTranslation Probabilities in Cross-language Information Retrieval
Translation ambiguity is a major problem in dictionary-based cross-language information retrieval. To attack the problem, indirect disambiguation approaches, which do not explicitly resolve translation ambiguity, rely on query-structuring techniques such as a structured Boolean model and Pirkola’s method. Direct disambiguation approaches try to assign translation probabilities to translation eq...
متن کاملQEA: A New Systematic and Comprehensive Classification of Query Expansion Approaches
A major problem in information retrieval is the difficulty to define the information needs of user and on the other hand, when user offers your query there is a vast amount of information to retrieval. Different methods , therefore, have been suggested for query expansion which concerned with reconfiguring of query by increasing efficiency and improving the criterion accuracy in the information...
متن کاملTopic Level Disambiguation for Weak Queries
Despite limited success, today’s information retrieval (IR) systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries). Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queri...
متن کامل